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Abstract 

We consider the spatial A-Fleming-Viot process model ( [BEVlOj ) for frequencies of ge- 
netic types in a population living in R'', in the special case in which there are just two 
types of individuals, labelled and 1. At time zero, everyone in a given half-space has type 
1, whereas everyone in the complementary half-space has type 0. We are concerned with 
patterns of frequencies of the two types at large space and time scales. We consider two 
cases, one in which the dynamics of the process are driven by purely 'local' events and one 
incorporating large-scale extinction recolonisation events. We choose the frequency of these 
events in such a way that, under a suitable rescaling of space and time, the ancestry of a 
single individual in the population converges to a symmetric stable process of index a G (1,2] 
(with a = 2 corresponding to Brownian motion). We consider the behaviour of the process 
of allele frequencies under the same space and time rescaling. For a = 2, and cf > 2 it 
converges to a deterministic limit. In all other cases the limit is random and we identify it as 
the indicator function of a random set. In particular, there is no local coexistence of types in 
the limit. We characterise the set in terms of a dual process of coalescing symmetric stable 
processes, which is of interest in its own right. The complex geometry of the random set is 
illustrated through simulations. 



On etudie le processus A-Fleming-Viot spatial (-BEVIO ) modelisant les frequences locales 
de types genetiques dans une population evoluant dans Mf^. On considere le cas particulier oil 
il n'y a que deux types possibles, notes et 1. Initialement, tons les individus presents dans 
le demi-espace des points dont la premiere coordonnee est negative sont de type 1, tandis 
que les individus presents dans le demi-espace complementaire sont de type 0. On s'interesse 
au comportement des frequences locales sur des echelles de temps et d'espace tres grandes. 
On considere deux cas : dans le premier, revolution du processus est due uniquement a des 
evenements 'locaux'; dans le second, on incorpore des evenements d'extinction et recoloni- 
sation de grande ampleur. On choisit la frequence de ces evenements de sorte qu'apres une 
renormalisation spatiale et temporelle appropriee, la lignee ancestrale d'un individu de la 
population converge vers un processus a-stable symetrique, d'indice a G (1,2] (oii a = 2 
correspond au mouvement brownien). On etudie revolution du processus des frequences 
alleliques aux memes echelles spatio-temporelles. Lorsque a = 2 et d > 2, celui-ci converge 
vers un processus deterministe. Dans tous les autres cas, le processus limite est aleatoire 
et on I'identifie comme la fonction indicatrice d'un ensemble aleatoire evoluant au cours du 
temps. En particulier, les deux types ne coexistent pas a la limite. On caracterise chaque 
ensemble en termes d'un processus dual constitue de mouvements stables symetriques coa- 
lescents ayant un interet en eux-memes. La geometrie complexe des ensembles limites est 
illustree par des simulations. 

AMS 2010 subject classifications. Primary: 60G57, 60J25, 92D10 ; Secondary: 60J75, 
60G52. 
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1 Introduction 

In this article, we are interested in the behaviour over large space and time scales of the spatial 
A-Fleming-Viot process (or SLFV) on M . This process arises as a particular instance of the 
framework introduced in [EthOSl IBEVlOl IBKElOj for modelling allele frequencies (that is fre- 
quencies of different genetic types) in a population that evolves in a spatial continuum. From the 
modelling perspective, this framework is interesting as it overcomes an obstruction to modelling 
biological populations in continua, dubbed 'the pain in the torus' by Felsenstein ( jFel75] ). which 
is typified by the 'clumping and extinction' seen in spatial branching process models in low di- 
mensions. The key idea of the SLFV framework is to base reproduction events on a space-time 
Poisson process rather than on individuals in the population. In this way one can define what 
can be thought of as a continuum version of the Kimura stepping stone model ( [ Kim53j ) which 
is a widely accepted model for evolution of allele frequencies in spatially subdivided populations. 
Moreover, one can incorporate large-scale extinction-recolonisation events through a series of 
'local' population bottlenecks, each affecting substantial portions of the species range. Such 
events dominate the demographic history of many species and, as we shall see in our results 
here, can have a very significant influence on patterns of allele frequencies. 

From a mathematical perspective, the SLFV process is a natural extension to the spatial 
context of the generalised Fleming- Viot processes which can be traced to §3.1.4 of |DK99] but 
were first studied in detail by Bertoin & Le Gall ( [BLGOS] ) . These processes are dual to the so- 
called A-coalescents which have been the subject of intensive study since their introduction over 
a decade ago by Donnelly & Kurtz, Pitman and Sagitov ( |DK991 IPit991 |Sag99| ). The duality 



with the generalised Fleming- Viot processes extends that between the Kingman coalescent and 
the Wright-Fisher diffusion and our work here will exploit a similar duality between spatial 
versions of the A-coalescents and the SLFV. One of the attractions of the SLFV processes is 
that they allow us to capture many of the features of Wright-Fisher noise, but in any spatial 
dimension (whereas stochastic partial differential equations driven by Wright-Fisher noise only 
make sense in one dimension). Thus, although they were originally motivated by purely biological 
considerations, we believe that these models are also of intrinsic mathematical interest. 

1.1 The spatial A-Fleming-Viot process 

First we describe the model. Each individual in the population is assigned a genetic type, 
from a compact space K, and a location, in R'^. At time t, the population is represented by a 
measurable function pt : M — )• A4i{K), where A4i{K) is the set of all probability measures on 
K. (In fact, as explained in ^ in defining the state space, H, of the process we identify any 
two such functions that are equal for Lebesgue-a.e. x G M .) The interpretation of the model is 
as follows: the population density is uniform across M'^ and, for each a; G M'^, if we sample an 
individual from x, then its genetic type is determined by sampling from the probability measure 
Pt{x)- 

The dynamics of the population are driven by a Poisson point process, LE, on M x M'^ x (0, c») x 
[0,1], each point of which specifies a (local) extinction-recolonisation event. If {t,x,r,u) G 11, 
then, at time t: 

1. An extinction-recolonisation event affects the closed ball B{x,r) C M^, and nothing hap- 
pens outside this region. 

2. A parent is chosen uniformly in the ball; that is, we sample a location z uniformly at 
random over B{x,r) and a type k according to the distribution pt-{z). 



3. For each y G B{x,r) (including z), a fraction u of the local population is replaced by 
offspring, whose type is that of the chosen parent. That is, 

Ptiy) ■■= (1 - u)pt-{y) + u6k. 

Here, we are thinking of reproduction events as equivalent to (frequent) small-scale extinction- 
recolonisation events. 

In [BEVlOj . the intensity measure of the Poisson point process 11 has the form dt ® dx ® 
C{dr,du), thus allowing the 'impact', u, of a reproduction event to depend on its radius, r. For 
instance, small-scale reproduction events may affect only a tiny fraction of individuals, compared 
to massive extinction-recolonisation events which could wipe out most of the population in a 
large geographical region. Of course, we require some conditions on the intensity of 11 if our 
process is to be well-defined: according to Theorem 4.2 of [BEVIO] (stated for d = 2, but the 
proof is identical for any dimension d > 1), the corresponding spatial A-Fleming-Viot process is 
well-defined whenever 

/ C{dr, du) uVr < oo, (1) 

J(0,oo)x[0,l] 

where Vr denotes the volume of a d-dimensional ball of radius r. 



1.2 Main results 

Our previous mathematical analysis of the SLFV process ( |BEV10"l IEV11| ) has been concerned 
with understanding the genealogical relationships between individuals sampled from the popu- 
lation. Here, although studying the lineages ancestral to a sample from the population will be 
fundamental to our analysis, we are interested in understanding the patterns of allele frequencies 
that result from such a model. 

Saadi ( [Saall| ) considers a closely related model (which differs from ours only in that the 
location of the 'parent' in a reproduction event is always taken to be the centre of the event). 
He considers the most biologically interesting case of two spatial dimensions and, for simplicity, 
takes all reproduction events to have fixed size r and fixed impact u G (0, 1]. Notice that if a 
particular genetic type is present in a region at some time t, then, unless n = 1, it will also 
be there at all later times. Saadi shows that if a particular genetic type is only present in a 
bounded region at time zero, then, with probability one, its range, that is the region in which it 
is ever seen is bounded. On the other hand, the shape of this region will be complex. In order 
to try to gain some understanding of the boundary of the range, he has also simulated a simpler 
situation. The idea is to consider just two 'competing' types on a two-dimensional torus which 
we can identify with {—L,L]'^ C M^. At time zero, all points of the torus with a non-positive 
first coordinate are of one type and all with a strictly positive first coordinate are of the other 
type. The region in which both types coexist develops in a rather complicated way, but it is 
natural to ask whether if one 'stands back' and views the process over large spatial scales (at 
sufficiently large times) a simpler pattern emerges. Saadi's simulations were the starting point 
for our work here. 

We shall concentrate our attention on two special cases of the SLFV model, in both of which 
individuals can be one of only two genetic types, labelled and 1. Evidently it is then enough 
to consider the proportion of type-1 individuals at each site and so we define, for every x S M 
and t > 0, 

w{t,x):=pt{x){{l}). (2) 



For the sake of clarity, we shall also take the fraction u G (0,1] to be the same for all events. In 
our previous notation, this corresponds to taking ((dr,dv) = ii{dr)5u{dv), for a measure /x on 
(0, oo). We shall allow the measure /i to take two forms: 

Case A (fixed radius): We fix r G (0, oo), and choose /x to be the Dirac mass at r. 

Case B (heavy-tailed distribution): We fix a G (1,2) and define the measure /i by 

^,{dr)=r-''-''-\r>l}dr, (3) 

where we recall that d is the dimension of the geographical space. 

It is easy to check that the condition ([1]) which guarantees existence of the SLFV process is 
satisfied in both cases. 

Case A bears some similarity to the nearest-neighbour voter model, in that an individual 
spreads its type (/opinion) in a 'close' neighbourhood. Case B incorporates some large-scale 
events and consequently, as we shall see, behaves very differently. The particular form of n 
is motivated by the fact that with this choice, under a suitable rescaling of space and time, 
the motion of an ancestral lineage will converge to a symmetric a-stable Levy process (and, 
more generally, the ancestry of finitely many individuals converges to a system of coalescing 
dependent a-stable processes, see ^. Combined with duality, this will imply that with the 
same space-time rescaling, the forwards in time process of allele frequencies will also converge 
to a non-trivial limit. 

Suppose that the initial condition of the process is 

w{0,x) = l{x(i)<o}, 

where here again xn\ denotes the first coordinate of x. In words, we start from a half-space H 
of I's. Let us set a = 2 in Case A, and, for a given a G (1, 2] and any n G N, define the rescaled 
density tu" by 

tt;"(t, x) := w{nt, n^/"x), t > 0, x G M'^. 

We denote by p" the H-valued process whose local density of I's at time t is w^{t, •). Our main 
results are the following two theorems, which describe the asymptotic behaviour of p" as n tends 
to infinity. In Case A, a'^ is the variance of the displacement, after one unit of time, of a single 
ancestral lineage from its starting point (see (O). 

^^ (2) 

Theorem 1.1. (Case A) There exists a 3-valued process {p\ , t > 0} such that 

p"' — > p^ ' as n — )• oo, 

in the sense of weak convergence of the (temporal) finite- dimensional distributions. 

Furthermore, at every time t > 0, the local density w^'^'{t, •) := pl ({1}) of type -1 individuals 
can be described as follows. If X denotes standard d-dimensional Brownian motion and 

p^{t,x) := Px-[A^2t (^H], t > 0, X G M'^, 

then: 

1. If d = 1, for every t > and a.e. a; G M, w^'^'{t,x) is a Bernoulli random variable 
with parameter p'^{t,x). The correlations between their values at distinct sites of M. are 
non-trivial and are described in Iil5\) . 



2. If d> 2, for every t >0 and a.e. x G M , 'w^'^'{t,x) is deterministic and equal to p'^{t,x). 

Remark 1.2. Note that, in one dimension, the two types almost surely do not coexist at any 
given point, since w^'^'{t,x) is a Bernoulli random variable. However, in higher dimensions, the 
two types and 1 do coexist at every site instantaneously. 

Remark 1.3. Although we have expressed everything in terms of densities, the convergence in 
Theorem II. 1^ which we define explicitly in ^ is equivalent the convergence for all j > 1 of the 
measures 

Pt{xi, dKi) . . . Pf{xj,dKj)dxi . . . dxj. 

(This is the vague convergence of nonnegative Radon measures on (M'^ x {0, 1})-'.) See [VWllj 
for a measure- valued formulation of the SLFV and for a proof of this equivalence. 

Remark 1.4. The quantity p'^{t,x) implicitly depends on the dimension. Also, since u and r 
are fixed, substituting in d?]). 



2 _ "" / J^ l^|2 



a^ = / dz \z\'^Lr{z) ( = when d = 1 

dVr 

is finite and proportional to ti. Indeed, Lr{z) := Vol(i?(0, r)ni?(0, z)) = (2r— |z|)_|_ in dimension 
1 and, more generally, Lr{z) < l{|z|<2r}K' for any d > 1. 

In contrast to the case of fixed radii, in Case B, in the limit as n — )• oo types are always 
segregated, irrespective of dimension. 

Theorem 1.5. (Case B) There exists a r.-valued process {p" \ i > 0} such that 

p" — > p^°^ as n -^ oo, 

in the sense of weak convergence of the (temporal) finite- dimensional distributions. 
Furthermore, there exists a symmetric a-stable process X°' such that if 



p"(x, t) := F^ [X^t £H], t>0, x£ 



r>d 



then for every f > and a.e. x G M'^, w^'^>{t,x) is a Bernoulli random variable with parameter 
p°'{t,x). The correlations between the values of the densities at different sites (and at the same 
time t) are again given by ( f75|) (or ^), where the process ^°° is now the system of coalescing 
a-stable processes obtained in Proposition \5.1\ 



Here again, one should notice that the speed of evolution of the limiting process is propor- 
tional to the parameter u. 

Comparing the results of Theorem II . II and Theorem ll.51 one can see that very large extinction- 
recolonisation events create correlations between local genetic diversities over a much larger 
spatial scale (n^'" >> ^/n) than purely local reproduction events. This is because an ancestral 
lineage can move a distance 0{n^'°') over the course of n generations. One might initially guess 
that, since the motion of a single ancestral lineage under our rescaling converges to a symmetric 
stable process, two distinct ancestral lineages would (asymptotically) only meet (and thus have 
a chance to coalesce) in dimensions where the stable process hits points. This is precisely what 
we see in Case A and, in that case, lies behind the deterministic limit in d > 2. However, this is 
where the dependence between ancestral lineages in the SLFV process (see ^3.2p comes into play. 
The detailed analysis of the ancestral process for Case B (which we present in ^ reveals that 



'very large' events are frequent enough to capture lineages that have moved to arbitrarily large 
separations. In particular, Lemma 15.31 shows that, in Case B, any finite sample of individuals 
will find its most recent common ancestor in finite time a.s. (see also Remark 15. 4p . The large 
events will, momentarily, create extensive areas in which the two genetic types coexist. Our 
analysis will also show that, under our rescaling, 'small' events then occur sufficiently quickly to 
instantaneously restore the allele frequencies in each infinitesimal region to or 1 (see also the 
simulations presented in ^. 

The rest of the paper is laid out as follows. In ^ we present some simulations that illustrate 
the results and the mechanisms underlying them. In ^ we are explicit about the meaning of 
'weak convergence of the (temporal) finite-dimensional distributions' and we describe the duality 
between allele frequencies and ancestral processes that provides the main tool in our proofs. It 
is then used to find conditions, expressed in terms of the genealogical trees relating individuals 
in a sample from the population, under which w^'^'{t,x) (at each time t > and a.e. point 
X G M ) takes the particular forms seen in our main theorems (see Lemma l3.2p . Theorems ll.il 
and 11.51 are then proved in 21 'ind ^ respectively. This last section also contains some results 
(Lemma 15.31 and the accompanying remark), of independent interest, on the system of coalescing 
(dependent) Levy processes that generates the genealogical trees relating a sample of individuals 
from the limiting population. 

2 Simulations 

Our results show that in the cases where the rescaled density of type 1 individuals converges to 
a random limit, at any fixed time that limit takes the form of the indicator function of a random 
set. In one dimension, provided that either n = 1 or a = 2 (the radius of events is fixed), the set 
takes a simple form, but for a S (1,2) this is no longer the case. In this section we present some 
simulations that illustrate the complex geometry of the limiting random sets and the mechanism 
that leads to their creation. We are extremely grateful to Jerome Kelleher from the University 
of Edinburgh for performing these simulations and producing the figures. 

First suppose that we are in one spatial dimension. If u = 1, then at every stage of the 
rescaling we will have w^{t,x) = li"(x) where /" is a half-line with right endpoint Rt following 
a random walk on M. Under our rescaling, as n — )• cxd, the process Rt will converge to a Brownian 
motion if q = 2 and to a symmetric stable process of index a for a G (1, 2). If q = 2, and d = 1, 
then the same is true for n < 1. This can be understood via the dual process of ancestral lineages. 
As we shall see, this converges to a system of independent Brownian motions which coalesce 
instantaneously on meeting. The type of an individual sampled at x at time t is determined by 
the type at time t before the present of the corresponding ancestral lineage. Since the Brownian 
motions are continuous, and they coalesce as soon as they meet, it is impossible for two lineages 
to 'cross over'. Consequently, asymptotically, if a lineage started from x traces back to a point 
to the left of the origin at time t before the present, then so must all lineages started from 
points to the left of x. As a result, at time t, the density of type Is will still be the indicator 
function of a half-line. The boundary, Rt, moves in the same way as a single ancestral lineage, 
that is as a Brownian motion with a clock that runs at a rate proportional to u. Figure [1] shows 
the results of a simulation of the process of allelic types in this case. In two dimensions, two 
Brownian motions won't meet and so for a = 2, asymptotically, the ancestral lineages will just 
look like independent Brownian motions and forwards in time, asymptotically, allele frequencies 
are smeared out by the deterministic heat flow. 

The case a £ (1,2) is much more interesting. Now, even in the limit, ancestral lineages evolve 
in a series of jumps and if n < 1 they can 'cross over'. Thus although our results show that the 
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Figure 1: Fixed radius in d = 1 on a line of length 20. (a) initial conditions; (b) after 10^ events; 
(c) after 10 events. The model parameters are u = 0.8, r = 0.033, n = 10^. 

limiting allele frequencies always look like the indicator function of a random set, even in d = 1 
we can no longer expect that set to be connected. Forwards in time what our results suggest, and 
simulations confirm, is that a large event can create a region in which allele frequencies are strictly 
between zero and one, but these frequencies are rapidly (and asymptotically instantaneously) 
'resolved' by 'small' events so that the state is restored to being the indicator function of a set. 
Figure [2] shows how on the line this mechanism leads to allele frequencies that look like a series 
of 'crenellations'. Even in one spatial dimension, our methods are not powerful enough to allow 
us to capture detailed information about the random sets observed in the limit. 

Figure [3] illustrates the same mechanism in two spatial dimensions. To isolate the effect in 
which we are interested, we suppose that a large event covers a previously unblemished portion 
of the interface and observe the resolution of the resulting patch of coexistence. 



3 Convergence and duality 

3.1 State-space and form of convergence 

In order to make the convergence in Theorems 11.11 and 11.51 explicit . let us recall some facts about 
the state space of the SLFV from [BEVIO] . In m.l\ we described the process as taking its values 
in the set S of all measurable functions p : W^ ^ Aii (K) (where the compact type space K is 
now {0, 1}). In fact, we need to define an equivalence relation on this space by setting 



p~p 



^ Vol({x G M'^ : p{x) / p{x)}) = 0. 



The state-space E of the SLFV is then defined as the quotient space H/ ~ of equivalence classes 
of ~. 



H is endowed with a natural topology described in §3 of |Eva97| . As mentioned earlier, it is 
possible to check that this topology is equivalent to the topology of vague convergence of the 
measures p{xi, dKi) . . . p{xj,dKj)dxi . . . dxj for all j > I as Radon measures on (M*^ x {0, 1})-'. 
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Figure 2: Variable radius in d = 1 on a line of length 20. (a) initial conditions; (b) after 100 
events, full range; (c) after 100 events, zooming in; (d) after 10^ events, full range; (e) after 10^ 
events, zooming in. The model parameters are u = 0.8, n = 10^ and a = 1.3. 
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Figure 3: Model in d = 2 after (a) 10^; (b) 10^; and (c) 10^ events. We have a square range 
of edge 8, and the initial patch is a circle of radius 4 with frequency 0.8 (white is frequency 1, 
black is 0). The model parameters are u = 0.8, a = 1.3 and n = 10^. 



In practice, Lemma 4.1 in [BEVIO] provides us with a family of functions which is dense in the 
set of continuous functions on S. To introduce this class of functions, for a space E, let C{E) 
denote the set of all continuous functions on E and, for a measure v, let L^{i^) be the set of all 
functions which are integrable with respect to u. For every j G N, "0 £ C{{M.'^y) n L^{dx®^) and 
Xi, ■ ■ ■ iXj £ C{K), we define the function Ij{- ; tp, {xi)i<i<j) as follows. For every /J E S, 



ij{p; '^,{xi)i<i<j) ■-- 



/ dxi ...dxj 'ip{xi,...,Xj) T\(xi,Pixi)) , 

JiR'^y Vr=i / 



where (/, u) is the integral of the function / with respect to the measure v. 
Since in our setting K = {0, 1}, we have, for every x, 

{X,p{x)) = x{'^)w{x) + x(0)(l - w{x)) 

= (x(i)-x(o))^(x) + x(o), 

where, as before, w^x) := p(x)({l}) denotes the mass of I's at site x. We can therefore restrict 
our attention to the set of functions Ij such that Xi = M for every i G {1, . . . , j'}, that is 



/ dxi ...dxj il){xi,... ,Xj) Y\w{xi) 

V(M^). Vr=i / 



/j(p;V)=/ dxi...dxj ij{xi,...,Xj) {\\w{xi)] . (4) 

V(M^). Vr=i / 

Indeed, any Ij{- ; tp, {xi)i<i<j) can be written as a finite linear combination of functions of the 
form ^ . The convergence stated in Theorems 11.11 and 11.51 can now be expressed for a single 
time t > as: for every j E N and V G C{(E.'^y) H L^{dx®i), 

Jim E[/,(pr;^)] =£[/,(/,("); V')]. (5) 

The extension of this definition of convergence to joint convergence at several times ti, . . . , t^ is 
straightforward. 

3.2 Duality between the SLFV and its genealogies 

The proofs of Theorems 11.11 and 11.51 rely on a duality relation between the SLFV process, and 
the system of coalescing jump processes that we call the genealogical process of a sample of 
individuals from the population. We recall this relation in the particular form in which we shall 
need it. In particular we restrict our attention to i^ = {0, 1}. A more general statement (and 
proofs) can be found in §4 of [BEVlOj . 

First suppose that we wish to trace the ancestry of a single individual alive in the current 
population. Let us, for now, work in a general setting as it will allow us to understand condi- 
tion ([1]) a little better. Since the model is translation invariant, without loss of generality we 
may suppose that the individual is currently at the origin in M . Tracing backwards in time, 
at the first time in the past when is in the area B(x,r) affected by a reproduction event, 
our individual has probability u of being an offspring of that event, in which case the ancestral 
lineage jumps to the position of the parent (which is uniformly distributed on B{x, r)). Since the 
Poisson process driving events is reversible, we see that the rate at which our ancestral lineage 
experiences a jump is 



C{dr,du)dx l^Q^B{x,r)}'u = / C{dr,du) uVr- 

R"* J(0,oo)x[0,l] ' J(0,oo)x[0,l] 

10 



By translation invariance in time and space of the law of 11, this tells us that the quantity in 
([1]) is just the instantaneous jump rate of an ancestral lineage (at any time and any location), 
and we are requiring it to be finite. We refer to §4 in |BEV10| for an explanation of why this 
guarantees existence and uniqueness of the process {pt)t>o- 

We will need to be more precise about the law of the compound Poisson process followed 
by an ancestral lineage and so we now establish the rate at which it jumps from to z (or, by 
translation invariance, from y to y + z). In order for such a jump to occur, first and z must 
both belong to the area hit by the event; second our lineage at must belong to the fraction u 
of individuals replaced; and third the parent must be chosen from site z. The intensity measure 
of the jump process is therefore equal to 

m{dz) = 11 C{dr,du)dxl{^(^B{0,r)nB(z,r)}UTir 

'(O,oo)x[0,l] ^T 




Cidr,du)^^^]dz, (6) 

(O,oo)x[0,l] ^r J 

where Lr{z) denotes the volume of the intersection -6(0, r) n B{z, r). (To see this, note that for 
an event of radius r to affect both and z, its centre, x, must lie in the region B{0, r) n B[z, r) 
and that since the parent is chosen uniformly from the region, the factor l/Y,- arises as the 
density of the uniform distribution on B(x,r).) In particular, by rotational symmetry, in the 
special case where the variance of the displacement of a lineage over one unit of time is finite, 
its covariance matrix is of the form cr^Id, with 



a 



m{dz) (^^(1)) = "7 / rn{dz) \z\ (7) 



(here, zn) denotes the first coordinate of z, and \z\ its L^-norm). 

Much of our analysis will rest upon understanding the ancestry of (larger) samples from 
the population, and these can be established in much the same way as the motion of a single 
ancestral lineage. If we sample k individuals (possibly from the same location), the ancestry is 
given by a system of (finite-rate) jump processes, which are a priori correlated, since their jumps 
are generated by the same Poisson point process of events. Furthermore, if at least two of them 
are encompassed by the same event and lie within the fraction of the local population replaced, 
then these lineages trace back to the same parent and thus merge into a single lineage during 
the event. Tracing further back in time, that single lineage and all other remaining lineages 
continue to evolve in the same manner. Note that \i u < 1, there may be other lineages in the 
ball where the event takes place, but not in the sub-population replaced. Such lineages neither 
jump nor coalesce during the event. 

Let {At)t>o be a system of finitely many (the initial number will always be specified explicitly) 
ancestral lineages as described above. That is, each lineage follows a finite-rate jump process 
with jump intensity dG]), and two or more lineages coalesce whenever they are affected by the 
same event. See Equation p4p in ^ for an expression for the generator of this process, in 
the particular case where u is fixed. For every t > 0, let us write Nt for the number of distinct 
lineages at time t, and ^^^, . . . , ^^ * G M for their spatial locations at that time. The weak duality 
relation we shall use in the sequel is also based on the family of functions in ([1]) , and states that 
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for every j G N and V G C{{R'^y) D L^{dx'^^), we have, for every t > 0, 

dxi . . . dxj i^{xi, . . . , Xj) K[w{t, xi) . . . w{t, Xj) | w{0, •) = wq] (8) 

dxi... dxj ipixi, ..., Xj) E[u;o(Ct ) • • • "^ofe^*) \No= j, ^^ = xi, . . . ,^1 = Xj] . 



Since ^ is valid for all functions ip as above, we also have for Lebesgue-a.e. (xi, . . . ,Xj), 

E[w{t,xi) . . . w{t, Xj) I u;(0, •) = wq] =¥.[wQ{il) ■ ■ ■ M^t') \ ^o = j, Co = xi, . . . ,CH ^j] ■ 

(9) 

Remark 3.1. The weak duality in Q is very similar to the duality between the Kimura stepping 
stone model and a system of coalescing random walks (see e.g. Chap. 6 of [Ethllj ). Here, 
however, in contrast to the discrete space setting, we cannot deduce an expression for the 
second or higher order moments of the it;(t,x)'s since ([9]) only holds for Lebesgue-a.e j-tuple 
{xi, . . . ,Xj) (and the Xj's are pairwise distinct for Lebesgue-a.e. vector). The problem stems 
from the fact the actual object with which we are dealing is the random measure w{t, x)dx and 
not the collection {w{t, x)}^^^d. The topology on H is too weak to consider the evolution of the 
density of I's at every single point, and we are obliged to characterize this density through a 
local averaging procedure, see p^ . 

Thanks to ([8|), proving the convergence of p" = {w{nt,n^'"x)},j.^^d boils down to showing 
that the genealogical process relating a finite sample of individuals converges, and to transferring 
the result to the forwards-in-time process. In addition, these duality relations enable us to obtain 
an explicit description of the local densities w^'^'{t, x). Indeed, ([5]) and ([8]) lead us to an implicit 
characterisation of the limiting random field p'"' through the values of 

Er/,(/3j"^ V)] =E / dxi...dxj ij{xi,...,Xj)(t\w^''\t,Xj)] 

However, the following result gives us more information on the form of the w'"^(t,x)'s. 

Lemma 3.2. Suppose that {pt)t>o is a '^-valued process dual to an exchangeable and consistent 
system of coalescing Markov processes {At)t>o through the relations (8\). Let {^t)t>o denote the 
Markov process followed by a single lineage, and suppose that the initial condition of p is such 
that for every t > 0, the map z i— )• 'Kz[w{{),$^t)\ is continuous on M (where as usual E^ denotes 
expectation under P^J. 

(i) If for every e > we have 

lim P [lineages 1 and 2 have not coalesced by time £ \ ^l = x, ^q = y] = 0, (10) 

\y-x\^0 

where the convergence is uniform with respect to x ^ W^, then for every t > and a.e. 
X G M'^, w(t,x) is a Bernoulli random variable with parameter Kx[w{0,^t)]- 

(a) If {At)t>o is a system of independent Markov processes which never coalesce whenever they 
start from distinct locations, then for every t > and a.e. x G M , w{t,x) is deterministic 
and equal to E,x[w{0,S,t)]- 
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Here, by 'exchangeable' we mean that the law of (^t)t>o is invariant under relabelling of the 
initial lineages; 'consistent' means that for every j E N, if ^ starts with j + l lineages but we only 
follow the evolution of the first j of them, we obtain a system of coalescing Markov processes 
that has the same law as A started with only j lineages. In other words, the evolution of the 
(j + l)-st lineage does not influence that of the other j. It is not difficult to see that the system 
iA.t)t>o introduced at the beginning of this section is indeed exchangeable and consistent (since 
each lineage present in the area hit by an event is affected with probability u independently of 
all others). The limiting genealogies we shall obtain will inherit these properties. 

Proof of Lemma 13.21 Let us fix t > 0, and consider the random measure £{dx) on M*^ defined 
by: for every nonnegative measurable function tp, 



I 

JW 



£{dx) ipix) 



dx ■ip{x)w{t,x) 



(11) 



Notice that, according to the description of E given in §3.11 wit,-) := pt{-){{l}) is in fact an 
equivalence class of functions of the form w : M — >• [0, 1]. Two representatives of w{t, •) differ 
only on a Lebesgue negligible subset of W^. For the rest of this proof we assume that for 
every oj in the probability space (il, J-", P) on which pt is defined, we have fixed a representative 
w{u}) : W^ — )• [0, 1] of w{uj, t, •) and define i{uj, dx) as in pT]) . with w{u, t, •) replaced by w{uj, •). 
Let {(pm)ineN be a sequence of continuous functions on M'^ such that for every m, < ipm < 1, 
ifm = 1 on B{0,l/m) and ipm = outside B{0,2/m). Let us write ipm{^'^) for the integral 
Jj^d dz ipm{z). Since w is locally integrable (it has values in [0, 1]), the Lebesgue Differentiation 
Theorem guarantees that for every a; G il, there exists a Lebesgue null set M{uj) such that for 
every x ^ M{uj), 

^™ mrf\ / ^{^.dz) ipra{x + z) =w{uj,x). (12) 

Consequently, by Pubini's theorem there exists a Lebesgue null set O such that for every x ^ O, 
the convergence in (J12p occurs with P(da;)-probability one. Evidently, if we can show that the 
random variable w{x) is as in the statement of Lemma 13.21 for every x ^ O, we shall obtain the 
desired result for w(t, •). 

Now fix X G M*^ \ O, so that ()12p holds P-a.s. We show that w{x) is a Bernoulli random 
variable under the condition stated in (i), and a deterministic constant under the condition 
given in (ii). Let j £ N. On the one hand, the Dominated Convergence Theorem yields that 



lim E 



^■n 



£{dz) (pm{x + z] 



E[ 



w[x 



On the other hand, by Fubini's theorem and ([8]), we have that for every m G N 



E 



95m (,i 



'^)-i / i{dz)^Ux + z) 

^'^y^ I dzi... dZj ipm{x + Zi) • • • ipm{x + Zj) E[w{t, Zi) . . . w{t, Zj)] 



"y / dzi...dzjE[w{0,C})---w{0,^f')\No=j, eo = zi,---,Ci 



(frn{x + Zi) ■ ■ ■ ipmix + Zj). 



(13) 



Zi X 



(14) 
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Since the Lebesgue measure of the set of j-tuples with at least two identical coordinates is 0, 
under the condition of (ii) the quantity in the right-hand side of p^ is equal to 









By our continuity assumption, this quantity tends to Ea;[tt;(0,^j)]-' as m — )• oo. Combined with 
()13p . this gives us that w{x) is a.s. equal to the constant 'Ex['w{0,S,t)] under the condition stated 
in (ii). 

To see (i), consider the case j = 2 (i.e., A consists of two ancestral lineages) and let us write 
r for the time at which they coalesce, with the convention that r = oo if ^ always contains 
two lineages. Since (pm{x + •) is concentrated on B(x, 2/m), using (jlOp we obtain that for every 
e>0, 

1™ Ti^^l^ / dzidZ2 (Pmix + Zi)(frn{x + ^2)!? W > e\No=2, ^0 = 21,^0 = ^2] = 0. 

Hence, for j = 2 and e < t, the quantity on the right-hand side of ()14p can be written 

/ dzidz2E[w{0,C^)lf^<^y\No = 2, ^^ = zi,S,q = Z2\ —^-^ + 6{e,m), 



where d{£, m) — )■ as m — )■ 00 for every fixed e. By the same argument, we have 

^ A W\ ^n elM \aT O fl cS .iPm{x + Zi)Lpm{x + Z2) 

'-Prn y^ ) 

dzldz2^[w{{),$,^)\NQ = 2, ^g = 2:1,^0 = ^2] j^^Tyj + 5{e,m) 

(M<*)2 ^m\)^ ) 

dzidz2 E^i [w{0, Ct)] T^^^Tn + S e, m) 

where 5'{£,m) also tends to as m — )• 00 for every e > 0, and the third line is justified by the 
consistency of {At)t>o- Using again our continuity assumption on z 1— )• E2[w(0,^j)], we obtain 
that under the condition stated in (i), the quantity on the right-hand side of l\14\i converges to 
Ea;[t(;(0,^i)] as m — )• 00. Hence, coming back to (I13p . we arrive at 

E[wixf] = E,[t/;(0,6)] =E[wix)]. 

Since w{x) G [0, 1] almost surely, we deduce that w{x) G {0, 1} almost surely, whence 'w{x) is a 
Bernoulli random variable. This completes the proof of Lemma 13.21 (i). D 

Note that (ii) corroborates a remark at the beginning of §5 in |Eva97 ] . In Evans' construction, 
all the genealogical processes used as duals are made up of independent Hunt processes that 
coalesce instantaneously upon meeting. Evans points out that, in this case, if ^ and ^' are 
two independent processes having the same law as the motion of a single lineage, then the 
corresponding H-valued process evolves deterministically iff 

Vol({(zi,Z2) G (K'^)' : P.i,.2[3 t > : 6 = ^I] > o}) = 0. 
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That is, if the set of pairs of starting points {zi,Z2) such that ^ and ^' have a positive chance 
to meet in finite time is neghgible with respect to Lebesgue measure, then for every t > 0, pt is 
a deterministic function of its initial value (and so is w{t, •)). Our proof of Lemma 13.21 gives an 
alternative proof of Evans's remark when the type-space K is {0, 1}. 

4 Proof of Theorem 11.11 

Let us start by proving the convergence stated in Theorem II .11 for a single time t > 0. Since we 
start from w{0, •) = 1_h'(-) (where H (ZMJ^ is the half-space of all points whose first coordinate is 
non-positive), for every n € N we have ^(0, ■\/n) = 1h{')- Hence, we need only prove the result 
for t > 0. 

Prom our definition of convergence (see (H])), our aim is to show that for every j G N and 

i;eC{{R'^y)nL^{dx^^), 



limE 

n—^oo 



dxi . . . dxj ij{xi, . . . ,Xj)w{tn,xi\/n) • • • w{tn,Xj\/7i) 



E 



dxi . . . dxj ip{xi, . . . ,Xj)w^ ' {t,xi) ■ ■ -w^ '{t,Xj) 



As we explained in ^3.21 this question boils down to establishing the asymptotic behaviour 
of 

dxi...dxjij{xi,...,Xj)E[w{0,^lJ---w{0,^^''^) \ Nq = j,^l = xi^/n, . . . ,il =Xj^/ri\. 

This will be achieved in Lemmas 14.11 and 14.21 below, but first we need some notation. Recall that 
^ represents the motion of a single ancestral lineage, that is .^ is a compound Poisson process in 
which jumps from x to x + z have intensity 

m{dz) = — ^ — dz. 

V y 

Observe in passing that this intensity is whenever \z\ > 2r (since the start and end points of 
a jump must belong to the same ball of radius r and so the size of this jump is bounded by 2r). 
For every n E N, let ^" be the process on W^ defined by 

er := ^ Ctn, t > 0, 



and let A^ be the corresponding rescaling of A in which time is multiplied by n and spatial 
locations are scaled down by ^/n. More formally, here we view A^ and A as having values in 
the state-space Um>o{"''} ^ (K'^)™'; thus the first coordinate of A or An indicates the number 
of distinct lineages, while the remaining coordinates give their respective positions. (In Section 
[5] we will find it convenient to enrich the state space to also record the whole genealogical 
information) . 

Lemma 4.1. If d = 1, for every j G N and xi, . . . ,Xj G M , the process A^ starting from j 
lineages at locations xi,...,Xj converges, in the sense of finite- dimensional distributions, to a 
system A°° of independent Brownian motions with clock speed o"^ that coalesce instantaneously 
upon meeting. 
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More generally, let k £ 'N and < ti < . . . < tk- Suppose that we start A^ with jo lineages 
at distinct locations xo,i, • • • ,xo,jo, let the process evolve until time ti, add to the surviving 
lineages j'l lineages at distinct locations xi.i, . . . ,xi.j^, let all resulting lineages evolve until time 
t2 when we add J2 further lineages, and so on. Call the corresponding process A^ . Define A.^ 
analogously. Then for any t > 0, the law of A^ converges to that of Af as n tends to infinity. 

Lemma 4.2. If d > 2, for every j € N and distinct xi, . . . ,Xj S M , the process AP' starting 
from j lineages at locations xi, . . . ,Xj converges to a system of independent Brownian motions 
with speed o^ . In particular, the limiting lineages never coalesce. 

More generally, define A"" and A°° as in Lemma \4.1\ Then for any t > 0, the law of A^ 
converges to that of A^ as n tends to infinity. 

We postpone the proofs of Lemmas 14.11 and 14.21 until the end of this section. 

Since the boundary of H has zero Lebesgue measure, Portmanteau's Lemma and the first 
part of Lemma [4. II give us that \i d = 1, (using the obvious generahsation to A^ of our previous 
notation) 

hm / dxi... dxj ^{xi, . . . ,Xj)E[lHiCln) ' ' ' 1/^fe^*") | ^o = i,^o = xi^/n, ...,^q = Xjy/n] 



hm 



dxi... dxj V'(xi, . . .,Xj)E[1h{C )---'^h{C, 



^n^NV--. 



\m 






dxi . . . dxj ipixi, . . . , Xj)'E[1h{^- 



t ) 



^oo,N, 



i^(C"*)|iVo^ = j,e, 



oo,l 




Xi, 



Xi, 



'SO 






ooj 



Now, Theorem 4.1 in |Eva97j guarantees that there exists a unique H-valued Markov process 
starting from (the equivalence class of) Ih^x) and dual to A°° through the relations ([8]). Let us 
call this process p^"^' . Using the more compact notation of ^3.11 we obtain that for every j G N 

and V e C{{R'^y) n L\dx'^^), 



\imE[l,ip^;i;)]=E[l,ip\'>;i^)]. 



(2). 



Since this family of test functions in dense in C(H) (c.f. ^3.ip . we can conclude that p^ ^ p\ as 
n — )• cxD. It is then straightforward to check that the conditions of Lemma 13.21 (i) are satisfied, 
and so for a.e. x G M'^, w^'^'{t,x) is a Bernoulli random variable with parameter 



P. [^r G ^] = F. [X^^t G ^] = p\t, x). 

Moreover, by Lemma |4. II and (l9|), the correlations between the values of w^'^'{t, 
sites can be described as follows. For every j G N and Lebesgue-a.e. (xi, . . . , Xj), 



at different 



E[w^^^ {t,xi)...w^^^'>{t,Xj)' 



E 



w 



(2) 



(o,er'')---..(^)(o,C"*) 



N^ 



J' ?0 



Xl, 



'SO 



C"gF, V.G{l,...,iV-} 



N^ 



J' ?0 



Xi, 



too J 
'SO 



(15) 



Since we are dealing with Bernoulli random variables, equation (jlSp completely characterizes 
these correlations. 
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If d >2, by the same chain of arguments (using this time Lemma l4.2p . we obtain 



Hm 

n— >oo 



/ dxi... dxj Tp{xi, . . .,xj)E[1h{S.L) ■ ■ ■ l-f^(Ct^*") I ^0 = j,Co = xiy/n, . . . ,Co = XjVn] 
J{Rdy 

= [ dxi... dx, V(xi , . . . , xj)¥^, [C' G ii-] • • • Px, [er'' e H] . 
J{R<'y 

Here again, these equahties guarantee the convergence in law of p" towards the value at time t 
of the unique H-valued Markov process p^"^' starting from 1h{x) and dual to the system A°° of 
independent Brownian motions which never coalesce. Lemma 13.21 (ii) then applies and gives us 
that for a.e. x G M , w^'^'{t,x) is the deterministic constant p'^{t,x). 

So far, we have obtained the desired convergence at a given time t > 0, and the form of the 
local densities of I's in the limit. It remains to show that the convergence holds true for finitely 
many times < ti < ■ ■ ■ < t^. Because functions of the form Ij{- ; ip) are dense in C(H), we 
need only show that for every ji , . . . , jk and ipi, . . . ,tpk, 



lim E 

n— >oo 



Il^M.■^^^) 



i=l 



E 



n^i^i/^r'^^ 



i=l 
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Therefore, let us fix ji, . . . ,jk and ■01, • • • , V'fc such that ■0j G C{{R'^y^) n L^{dx®^^). To simplify 
notation, we write x* for the vector (x^,...,x*.) and W^{x^) for the product H/li '"^"(^J'^D- 
We will occasionally abuse notations and write x* U x^ for the concatenation of the vectors x* 
and x^ . Our strategy is to use duality again, but now with the genealogical process described in 
the second part of Lemmas 14.11 and 14.21 Once again, to simplify our notation, let us denote the 
law of A"' (resp., A^) starting from j lineages at locations x = (xi, . . . ,Xj) by P" (resp., P"f). 
Hence recall that P^ is a distribution over Um>o'{'^} ^ (M*^)™. Using the Markov property of w 
at time tfc_in and the duality property ([8]), we can write 



E 



X{ln{pl-^i^'^ 



i=l 



dx} ■■■ dx!" ipiix^)--- ipkix'') 



E 



k-2 



XE 



fc-1 



«=1 



U^n^i') K^ w-itk.^,c:-t,J■■■^''(tk-l,i 






y . . . y dxi • • •dx'^ Mx") ■ --Mx^) j d¥l>.,t,-t,_, K_i,yt"\ 






J 



'- ^ j=i 



U^rix') w^{t,_^,xt')■■■w^{tk^,,x';;}Jw^{tk^^,yt')■■■ t/;"(tfc_i, y^.^J 



Since the law of the locations at time tk — tfc-i of the N"^_f^__^ lineages is absolutely continuous 
with respect to Lebesgue measure, we can carry on the recursion and use the Markov property 
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(this time at time tfc_2) and duality to write the quantity above as 



X / dP^fe_i^^fc_i^^^^_^_^^_^, 



{mk-2,yi ^...,yi,!jE 



fc-3 



ll^^^^') 



i=l 



xw^{tk-2,x',-^)... w^{tk-2,x';;_l)w^{tk_2,yt')--- w^{tk-2,yit.) 



dx}--- dx^ iPiix^) ■ ■ ■ V^fc(x^) / dP^fc^i^_t^_^ (mfc_i,yi^ \ . . • ,y^;ij 



^iPIw,*iK'y?'---'y-o)Ek(o,y?)---^"(o,y^o) 



(17) 



Now, recall the family of processes J^ introduced in the second part of Lemmas 14.11 and 14.21 
Let us denote the times of appearance and the locations of the additional lineages in the form 
(''"1)1"^)) • ■ ■ J i'Tkjz!'). Using (recursively) the Markov property of ^", we obtain that the quantity 
on the right-hand side of p!7|) is equal to 



dx} ■■■ dx^ ipi{x}) ■ ■ ■ ipkix^) 



xE 



w 






(0,x'=), {tk - tfc-i,x^ ^), . . . , (ifc - ti,x}) 



Let us now conclude when d = 1 (the reasoning is exactly the same when d >2). Recall that for 
every n G N, ^"(0, •) = Ih(-) = w^'^'{0, •). By the second part of Lemma [4. II and the Dominated 
Convergence Theorem (and the fact that the boundary of H has zero Lebesgue measure), we 
obtain that 



lim E 

n— >oo 



dx} ■■■ dJ} '4)i{x}) ■ ■ ■ ipkiJ") 



xE 



Analogous calculations using the duality between A°° and p'^-* lead to (J16p . This completes the 
proof of Theorem 11.11 D 

It remains to prove Lemmas 14.11 and 14.21 Let us start with the latter, which is somewhat 
simpler, but contains the main ingredients of both proofs. 

Proof of Lemma 14. 2L Let xi,...,Xk be k distinct points of M . Suppose that A^ starts 
from k lineages at locations xi-^/n, . . . ,Xk\/n. First, since a single lineage ^ follows a finite-rate 
homogeneous jump process whose jumps are uniformly bounded by 2r, standard arguments 
guarantee that ^" = {n^^''^Ctn)t>o converges in distribution to Brownian motion with clock 
speed (T^ given in dT]). 

Second, observe that two lineages can be hit by the same event (and possibly coalesce) only 
if they lie at distance at most 2r of each other. Consequently, as long as they are at distance 
greater than 2r they evolve independently, according to the law of the motion of a single lineage. 
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Hence, let us define riTn to be the first time at wliich at least two of tlie k initial lineages are 
within distance at most 2r of one another. Equivalently, Tn is the first time at which at least 
two lineages of A^ are at separation at most 2r/y/n. We wish to show that for any t > 0, 
FxK < t] ^ as n ^ oo. 

To this end, note that until time x„, the motions of the rescaled lineages ^"''^, . . . ,^"'''^ can 
be embedded in the paths of independent standard Brownian motions X^, . . . ,X^ starting from 
xi, . . . , Xfc (we use the same Brownian motions for all n). Indeed, for each path i we proceed as 
follows (this construction is in the spirit of the one-dimensional Skorokhod Embedding Theorem, 
see e.g. [Bil95] ) . Let (-R?'*)j>i be a sequence of i.i.d. random variables (independent of X*) 
distributed according to the law of the radius of a typical jump of ^", and let us define a sequence 
{sfj,j > 0} of random times, recursively, by 

1. <o ■■= 0, 

2. for every j > 1, s" ■ is the first time greater than s"_j^ at which X* exits the ball 
B{Xl„ ,RY). 

By rotational symmetry of the law of a jump of c^'^'*, conditional on its radius being 7 the 
location of 1^"'* just after the jump is uniformly distributed over the sphere 0S(^^*, 7). Likewise, 
conditional on the variable i?"'* being equal to 7, the location of Xln is uniformly distributed 
over dB{Xln ,7). Consequently, by comparing their jump rates and their jump distributions, 
one can show that for every i G {1, . . . ,k] the processes (C"'*)t>o and (X*„ ) have the 

same laws, where {j{n,i,t))t>o is a Poisson process with intensity nuVr (recall from ([1]) that 
uVr is the jump rate of an unrescaled lineage under the conditions of Case A, where Vr is the 
volume of a ball of radius r). Since the lineages ^"'^, . . . ,'^"'-' evolve independently until time 
Tn, we can ask that the Poisson processes {j{n, 1, •),... ,j{n, k, •)} should be independent and 
the embedding holds for all i S {1, . . . , A;} simultaneously until the first time t such that 

\Xlu - X^ I < 2r/^/n for some i / m. 

' i,j(n,i,t) m,j{n,m,t) ' 

Now, each rescaled lineage makes jumps of size at most 2rn~^'^ at rate 0{n). Hence, each 
difference sf, — s"._i is the exit time of Brownian motion from a ball of radius 0(n~^''^), and 
s" •/„ i t/\T ) ^^ ^^^ ^^™ '-'^ (morally) 0{n) such quantities, all independent of one another. More 
formally, if we write R for the (random) radius of a typical jump of an unrescaled lineage and 
if we notice that the exit time of Brownian motion starting at from a ball -6(0,7) is bounded 
by the first time that one of its coordinates leaves the interval [—7,7], then for all n G N and 
all 1 < i < /c we can write 

21 d „r„oi . 4:dr'^ 



E[sl^] <dE {Rl'y =-E[R^] < 



n ■- -■ n 



where the first inequality uses the property that the exit time from [—7, 7] of one-dimensional 
Brownian motion starting at has expectation 7^. By the independence of Xi and the Poisson 
processes, this yields that for all n and i, 

EK.Xn.MAr^)] =IE[j(n,i,tAr„)].E[.-i] <MuVy. 

To conclude our proof, let us observe that P"[t„ < t] is bounded by the probability that 
at least two of the k independent Brownian motions X^ , . . . ,X come within distance 2rn^^'^ 
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before time min{s" ■(^^^\, ^ — ''■ — ^}- ^^^^ ^^ ^^i denotes the first time at which two independent 
Brownian motions starting at xi 7^ X2 come within distance 2rn~^'^, for every T > we have 

Hm P^.,,^2 [rn <T]=0. 

Hence, the probabihty that at least two out of k independent Brownian motions come within 
distance 2rn~^''^ before any given time T also tends to 0, and thanks to the uniform bound on 
the expectation of s^j/nHAr ) (together with the Markov inequality), it is straightforward to 
obtain that for any t > 

lim P^ Wn < tl = 0. 

We have thus shown that with probability growing to 1 as n — ?■ 00, until a given time t > the k 
ancestral lineages evolve as if they were independent. Since the law of each ^"■'* converges to that 
of Brownian motion with clock speed o"^, the convergence of the one-dimensional distributions 
of A^ to those of a collection of k independent Brownian motions is proved. 

The proofs of the convergence of the finite-dimensional distributions and that of the second 
part of Lemma 14.21 follow the same lines, using the Markov property of each A'^ at suitable 
times. Details are left to the reader. D 

Proof of Lenima l4.1l Once again we start with the one-dimensional distributions, and proceed 
by recursion on the number m of lineages of A"^. As in the proof of Lemma [4.21 before rescaling 
each lineage follows a homogeneous symmetric (finite rate) jump process, whose jumps have 
length at most 2r, and so ^" = {n~^''^^nt)t>o converges in distribution to Brownian motion with 
clock speed a'^ as n tends to infinity. 

Let us consider the case m = 2. As we saw in the proof of Lemma 14. 2^ the two rescaled 
lineages evolve independently until they come within distance 2rn~^'^ of one another. Let us first 
show that this 'meeting' time converges to the meeting time (at distance 0) of two independent 
Brownian motions starting at xi and X2 and with clock speed a"^, and secondly that coalescence 
is quasi-instantaneous once the lineages are gathered at this distance. 

For the first claim, let us write t„ for the time at which ^"'^ and ^"'^ first come within 
distance at most 4rn~^'^ of one another (note the constant 4 instead of 2, which we shall need 
later for purely technical reasons). Because the motion of a single lineage is a symmetric jump 
process, until r^ the law of the difference ^'^'^ — ^"'^ is the same as that of the motion of a 
single rescaled lineage, run at speed 2. Let X be a standard one-dimensional Brownian motion, 
starting from xi — X2 and independent of all .^"'s. Using anew the construction introduced in the 
proof of Lemma 14.21 for every n we can find a sequence of random times {s'^,j > 0} such that 

iCt^' ~ Q' )t>o has the same law as (Xs" )t>Oi where j(n, •) is a Poisson process, independent 
of X and with intensity Anru (that is, twice the jump rate of a single rescaled lineage). Recall 
from the proof of Lemma 14.21 that for every n G N, the random variables s^ — s'^_^^ j > 1, are 
i.i.d and if R is distributed like the radius of a typical jump of ^, we have E[ns"] = E[i?^] < 00. 
Let t > 0, and, as a first step, let us show that s"/ ,-. converges in probability towards 2cr^t 
as n grows to infinity. The second step will then consist of proving that, for every t > 0, the 
probability that r„ > t tends to the probability that the hitting time of by X is greater than 
2o"^t. This will give us the desired result. 

By definition, j{n,t) is a Poisson random variable with parameter (4nMr)t. By the Central 
Limit Theorem, we therefore have that 

n^^/^{j{n,t)-Anurt) -^ J\f{0,4urt). (18) 



20 



Now, recalling the properties of the s" — s^_iS expounded above, by the Strong Law of Large 
Numbers we have 



' [4niiriJ 



[Anurt] 
n ^ — ^ ^ 



n 



i) -^ 4urt X E[R^] as n — )- oo, 



(19) 



i=l 



where [z\ denotes the integer part of z. But cj^ is defined in ([7]) as the variance of the displace- 
ment at time 1 of a single unrescaled lineage, and so 

(j2 = 2urE[R^], 

which shows that the limit in (fT9]) is equal to 2cr^t. To conclude the first step, observe that 
l^?(n t) ~ ^\^Anurt\ I ^^ ^^^ ^^™ °^ lj("'' ^) ~ L^'^'W^J I i-i-d- terms of the form sf — sf_i, all of them 
independent of j{n, t), so that for every e > and every n > 1 we have 

„3/4 



'[ki(n,t) - ^l^nurti \ > ^] < lP[lj(n, *) " ^nurt\ > n^/^] + P 



E (^r - «?-i) > e 



i=l 



As 71 — )• cxD, the first term on the right-hand side tends to by (jlSp . while Markov's inequality 
gives us that 



n3/4 

E 



>r - ^^i) > e 



< - E 
e 



n3/4 

E(^" 



" «r-i) 



c 



en 



1/4 



Since this is true for any e > 0, s'^,^-, — sV^^^^^, converges in probability to 0. But we have 
shown that sV^^^^^, converges a.s. to 2a'^t, and so we obtain that s". ^. converges in probability 
to 2c7^t, as required. 

As explained above, we can now use this result to show that t„ converges in distribution to 
the hitting time of by {X2fj2t)t>o- Indeed, by construction of the random times sf and the 
fact that the rescaled jumps of a lineage are bounded by 2r/y/n, for any i > 1 the Brownian 
motion X cannot move to a distance greater than 2r/y/n from X^" before time sf. Thus, if 
To denotes the hitting time of by X, we have 

V^,-X2[rn >t]< r^,-^^ [to > Sj"(„^jJ . 

But we showed that s'^(nt) converges in probability towards 2a'^t as n — )■ oo, and so 

lim sup F^,_^, [t„ > i] < P^,_^.2 [to > 2ah] . (20) 

n— >oo 

On the other hand, for every e £ (0, |xi — X2I/2) and every n large enough, we can write 

Fxi-xaK > t]>F^^^^^[X does not enter B{0,Ar/^/7i) before s^(„^^)] 
> Pa;i--Z2 [^ does not enter B{0, e) before s"/^ ^J . 

Again, we can deduce from the convergence in probability of s"/ ,n to 2a^t that 

liminf Fx-^-x2['Tn > *] ^ IPa:i-z2 \^ does not enter 5(0, e) before 2a t\. 
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This inequality holds for every small e > 0, and by the point recurrence of one-dimensional 
Brownian motion, we can conclude that 



liminf ¥.^^^^^[Tn >t\> 



X\-X2 



[TQ>2aH]. 



Together with ()20p . we obtain that for every t > 

lim ¥^^^^.^[Tn >t]= P^,_^2 h > 2cj2t] 



(21) 



(22) 



from which we can conclude that the 'meeting time at distance 4r/-^/n' of two rescaled lineages 
starting at xi and X2 converges in distribution to the hitting time of by Brownian motion with 
clock speed 2o"^, or equivalently to the meeting time of 2 independent Brownian motions each 
of clock speed a"^ . 

Let us now prove our second claim; that is, let us show that once at distance at most 
Ar/^/n, the additional time the two lineages need to merge becomes negligible as n tends to 
infinity. Because the proof is highly reminiscent of that of Proposition 6.4(6) in [BEVlOj . we 
only outline the main steps here. Let us work with the unrescaled lineages, and suppose they 
start at distance at most 4r of each other. First, it is not difficult to convince oneself that the 
first time at which the two lineages are at separation less than 2r is of order 0(1), 'uniformly' 
over all initial locations which are at separation at most 4r. Once close together, they become 
correlated, because they can be hit by the same reproduction event. But for the same reason, 
they have a positive probability of being affected by the same event and of coalescing before 
separating again to distance at least 2r. If they do coalesce, the additional time they had to 
wait for this event is also of order 0{1). If they separate rather than coalescing, then again the 
time they need to come back to separation less than 2r is of order 0(1), and once 'gathered' 
they have a positive chance to coalesce before separating, and so on. In the end, the number of 
excursions of S} — ^^ out of -6(0, 2r) before the two lineages merge can be stochastically bounded 
by a geometric random variable, and each of the finitely many excursions and incursions lasts a 
time of order 0(1). This tells us that for every e > 0, one can find T{e) > such that 



sup 

\yi-y2\<4:r 



(2/1:2/2) 



[^ and ^ do not coalesce before T(e)] < e. 



Rephrasing the above inequality in terms of the rescaled lineages, we obtain that, for every 
n> 1, 

sup ^{zuz2) [^"'^ and C'^ do not coalesce before T{e)/n] < e. (23) 

|2i-22|<4r/v^ 

Finally, if r^ denotes the coalescence time of ^"'^ and ^"'^, using the strong Markov property of 
^^n,i^^n,2-j ^^ time r„, we have, for every i > 0, 



{xi,X2) Vn 



>t\= ^(xi,X2) l{r„<oo}IP(g-^\^;?^2^[Tn > A 



By (|23|) . the probability inside the expectation tends to as n — )• oo, and so does the quantity on 
the left-hand side (by dominated convergence). Hence, t'^ — t^ converges to in probability. This 
concludes the proof of the first part of Lemma l4.ll when m = 2: in the limit, the two lineages 
follow independent Brownian motions run at clock speed o"^ until the first time at which they 
meet, which is also the time at which they coalesce by the convergence of r^ — r^ to 0. 

We now proceed by induction. Suppose we know that the result of Lemma 14.11 holds true 
for a system of ?ti — 1 lineages. Let xi < . . . < Xm be m distinct points of M and suppose that m 
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lineages start from these locations. Because the lineages 'choose' to take part in an event that 
encompasses them independently of one another, the law of the restriction of the system started 
from m lineages to that started from m — 1 lineages at xi, . . . , Xm-i is the same as that of the 
{m — l)-system starting from xi, . . . , Xm-i- (This is the 'consistency' of the genealogical process 
described below Lemma [32]). Hence, our inductive hypothesis tells us that the restricted process 
converges to a system of (initially) m—1 independent Brownian motions with clock speed o"^, that 
coalesce instantaneously upon meeting. Now, as we explained several times already, the motion 
of the m-th lineage, starting at the right-most location Xm, is independent of that of the others 
until the first time, t„, at which it comes to within distance 2r j y/n of another lineage. But with 
probability tending to 1, the right-most lineage among those that started from xi,...,Xm-i 
is the lineage ancestral to the individual sampled in x^a-x- Indeed, our inductive hypothesis 
guarantees that the probability that the lineage starting from Xm~i jumps over a lineage on its 
left without coalescing with it tends to as n tends to infinity. Again by consistency of the 
genealogical process, when singled out, the motion of lineage m — 1 has the same law as the 
process ^" (that is, a typical single lineage), and so we can focus on the two right-most lineages 
and use the results obtained for m = 2 to conclude: their meeting time at distance at most 
At I ^fn converges in distribution to the meeting time of two independent Brownian motions run 
at clock speed a^ ^ and in the limit this meeting time is also the coalescence time of the two 
lineages. But this is precisely the evolution of a system of (initially) m independent Brownian 
motions which coalesce instantaneously when they meet, and so the desired convergence also 
holds for a system starting with m lineages. 

As in the proof of Lemma 14.21 the other points of Lemma 14.11 are obtained by using the 
convergence of the one-dimensional distributions and the Markov property at suitable times. D 

5 Heavy-tailed case 

In this section, we prove Theorem 11.51 and give some properties of the limiting genealogical 
process, which are of independent interest. Recall that the fraction of individuals affected by an 
event is set constant, equal to u G (0, 1], and the radii of the events are sampled according to 
the intensity measure 

^(dr)=r-°-'^-il|,>i}dr, 

where d is the dimension of the geographical space. 

As in the proof of Theorem 11.11 due to the duality relations ([8]) we need only establish 
the asymptotic behaviour of the rescaled genealogical process (^")i>o of a finite sample of 
individuals, defined in our previous notation by 

^r = [Q\ . . . , i^^) ■■= [n-^'-ent, . . . , n-^/^eo . 

In words, we speed up time by a factor n and scale down the spatial locations of the lineages 
by n^'". Indeed, if we can show that the finite-dimensional distributions of A^ converge to 
those of a system of coalescing processes A°^ that has sufficiently nice properties (i.e., which 
can be used to construct a dual H-valued process p^^' using the technique of |Eva97| ) . then 
the same arguments as those used in the proof of Theorem 11.11 will grant us the convergence 
of the finite-dimensional distributions of p" to those of /)'"' . Then it will remain to show that 
A°° satisfies the conditions of Lemma [3.2( z) to obtain the desired form for the local densities of 
I's, w^'^'{t,x), and to use ^ to characterize the correlations between these Bernoulli random 
variables. Hence, the crucial step is to prove the following proposition. 
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Proposition 5.1. There exists a system A°° of coalescing symmetric a-stable Levy processes 
such that 

A"-^A°°, as n -^ oo, 

in the sense of weak convergence of the finite- dimensional distributions. Moreover, if we define 
the process AP" and A°° in an analogous way to the corresponding processes in Lemmas \4-l\ and 
\4-^ we also have convergence of the one- dimensional distributions of A^ to those ofA"^. 



Proof of Proposition 15. IL Our aim is to write down the generator Q"' of A", and to show 
that it converges to the generator of a system of coalescing symmetric a-stable processes. Up to 
now, we were able to be rather vague about the precise representation of the ancestral lineages, 
but in order to write down a sensible generator we now need to be more precise. Suppose we 
start with k lineages. The system at any time t > is represented by a marked partition of 
{1, . . . ,k}. Each block of A"' contains the labels of all individuals in the initial sample which 
have the same ancestor at time t in the past (that is, whose ancestral lineages merged before t), 
and the mark associated to the block gives the spatial location of this ancestor at time t. 

Since only the lineages present in the area hit by an event can be affected by this event, for 
every y G M , r > and every marked partition A let us write J{y, r, A) for the set of indices of 
lineages (blocks) of A whose mark belongs to B{y, r) (to index the blocks of A, we rank them in 
increasing order of the smallest label that each contains). For convenience, we shall also use the 
notation Jn{y,r,A) := J{n~^''^y,n~^'°'r,A). Next, if A contains m blocks and I C {1, . . . ,m}, 
then for every z G M we write ^i{A, z) for the marked partition obtained by merging all blocks 
of A indexed hy i G I and by assigning the mark z to this new block (the other blocks and 
marks remain unchanged). For instance, if ^ = {({1, 5},xi), ({2,3},X2), ({4, 6}, 3:3), ({7}, 3:4)} 
and I = {1,4}, then 

<^i{A,z) = {({l,5,7},z),({2,3},X2),({4,6},X3)}. 

Finally, we write |/| for the cardinality of the set I, and we recall that Vr denotes the volume of 
a ball of radius r. 

Because lineages jump and merge at finite rate, the generator Q of the system of unrescaled 
lineages {At)t>o can be expressed as follows. For every bounded measurable function / and 
every marked partition A (of some finite set {1, . . . , A;}), 

gf{A)= [ dy Tfiidr) [ ^ Yl ^'"(l-^)'-'^"[/(^/(A^))-/(vl)], (24) 

Jr^ Jo JB{v,r) Vr j^j^y^,^^^ 

where in the above and what follows we write J as a shorthand notation for J{y, r, A). Indeed, 
if an event occurs in B{y,r) and the parent is chosen at location z, then every lineage present 
in this area is affected by the event with probability u, independently of each other, and all 
lineages that are affected merge and jump onto the location z of their parent. 

Mutiplying time by n and marks by n~^'", we obtain from the expression in (j24p that the 
generator of A!^ is given, for every / and A as above, by 

gy{A)=n [ dy r Kdr) f ^ Yl ^'"(1 " ^^)'-'^" [/(^/(A^-^/^z)) - /(^)] . 

7m^ Jo JBiy.r) Vr j^j^^y^,^^) 

To see where the sum comes from, observe that an unrescaled mark belongs to B{y,r) iff its 
rescaled version belongs to B{n^^'°'y,n~^'°'r), and that the affected (rescaled) lineages jump 
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onto n ^'"z when their unrescaled counterparts jump to z. Making the change of variables 
z' = n~^i'^z, and then y' = n~^'°'y and r' = n~^'°'r, we obtain that G"'{A) is equal to 



1+^ 
n « 



dyT^J V E u\\l-ur\\fmA,z))-f{A)\ 

= b'r.^f..V ^ ul^l-ur^l[fmA,z))-fiA)] 

jRd Jn-l/c. r JB{y,r) Vr j^j^^y^^^j^^ 

^L/'Tu^f^.V ^ uil-uri-^[m,,,iA,z))-fiA)]. (25) 

Let us define S{A) as half of the minimal pairwise distance between marks in A {6(A) := -|-oo if 
A contains only one block), and let us show that for every A such that S(A) > and every / 
compactly supported and of class C^ with respect to the marks, G"-f{A) converges as n — )• cxd 
towards the quantity Q°'f{A) defined by 

g^f{A) 



7Rd Jo r JBiy,r) Vr ,^^(^^,^^)^|,|>2 



j=i 



1^1 



^x|J(y,r,A)|-l 



dz 

- [f{^{,yiA, z)) - f{A) -{z- x„ V./(A))l||,_,,|<i}] 

B{y,r) Vr 



^^-1^ JRd JO r ^ ^ J B{y,r) ^r 

(26) 

where | A| denotes the number of blocks of A, Xi is the mark of the z-th block, V,/ is the gradient 
of / with respect to Xi and (•, •) is the scalar product in M*^. 

We shall comment on the different terms of Q"' f{A) later. For now, let us show the desired 
convergence, as well as the finiteness of 0°^ f{A). Let us start with the first term on the right- 
hand side of (|25p. By definition of 5{A), a ball of radius r < S{A) cannot contain more than 1 
lineage (mark), so that the integral over r runs in fact from n~^'°' V 6{A) to +cxd. For n large 
enough, this first term is thus equal to 

Jr^ JsiA) r-+^^' JBiy,r) K j^j^yf;Xw\>2 

and so is the first term ofG°'f{A). Since u £ (0, 1], / is bounded, the sum over I is finite and since 
any event location B{y, r) must intersect the compact support of / to have a nonzero contribution 
to the generator (so that we may restrict the integral over y to some ball B(0,r + A(/)) with 
A(/) depending only on /), there exists a constant C(/) > 0, independent of A, such that the 
absolute value of the first term of G"f{A) is bounded by 

fOO 1 

J5{A) r"+^+i 
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Now consider the second term on the right-hand side of ()25p . Let us spht it once again into 
/" f°° dr f dz v-^ 

^"' ' tiEJ{y,r,A) 

X [/($|,}(A z)) - f{A) -{z- x„ V,/(A))l||,_,^|<i}] (28) 

We rewrite X^jgjr^^.^) as YlUi l{xiGB{j/,r)}) and, for n large enough, we spht the integral over 
r G [n~^'",oo) in (j29p into the integral over [n~^''^,6{A)) and that over [6{A), oo). The second 
integral is finite for the same reasons as in (I27p . On the other hand, if r < S{A) then J{y, r,A)<l 
for every y, and so the first integral is equal to 

^ f /■'^(^) dr f dz 

2^ dy ^^l^^^^^^y^^,^y / (^_a;.,Vi/(^))l||,_,^l<i| 

= ^Z^/ ,, T/^a+d+i / , dz / dy l||,^_y|<,|l||,_y|<,|(z-x„V,/(A)) 

= ^E/ ,, TFT^JTdTT/, dz(Vol(i?(x.,r)nS(.,r)))(.-x„V./(A)), 

and, by symmetry, the integral over z is equal to for every r. The integral in (j29p is thus equal 
to 

1^1 



« 



~^JR<* J(5(A) r ^ ^ JB{y,r) ^r 

and if we decompose the range (0, cxd) over which we integrate r in the third term of Q'^j{^A) 
into (0,(5(A)) and [5(74),oo), we find that the integral over the latter is equal to the quantity 
above. 

Finally, let us show that (|28p converges to the second term of ^"/(A). This time, we split 
(1281) into 



1^1 
u 



Y.f/yr, ''T°?i:"'"' "-")''"'""""/, ,f{/(*«(A^))-/(A))i,i„,M, 

i=l -^^ Jn~^/°' r ^ ^ JB{y,r) ^r 

\A\ 



X / IT {fi'^{^M^ ^)) - /(^) - (^ - ^«> V,,/(^))) 1{|.-.,|<1}. 

JB{y,r) ^r 



The first term is finite for the same reasons as in (j27p . since for the parent to be at distance 
greater than 1 from the affected lineage, one must have r > 1/2. Now, using the same steps as 
above, we obtain that the second term is equal to 

n V / dz r TT^^ I dy{l- ^)l^fe.^'.^)l-i 

^ Jb{x,,1) Jn'^/^V^-^ K-r^+'^+l JB(z,r)nB{x,,r) 

X {fi^{,y{A,z))-f{A)-{z-Xi,VifiA))). (30) 
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But / is of class C^ and has compact support, and so we can find a constant C{f) > 0, 
independent of A, such that for every i and every z G B{xi,l), 

\f{^{,yiA,z)) - f{A) -{z- x„VaiA))\ < C{f)\z - x,\\ 

As a consequence, the absolute value of the quantity in (I30p is bounded by 



wCV [ dz r dr Yo\{Biz,r)nBixi,r)) 



Z Xi 



<uC'\A\ / dz |zp(n-i/"V(|z|/2)) 

JBiO,!) 



— a—d 



= uC'\A\ I n^+^ / dz |z|2 + 2"+^ / dz Izl^-"--^ \ 

y JB(0,2n-'^/°') JB(0,l)\-B{0,2n-i/<i) J 

<C"\A\{n-'-^+C"'{l-n-'-^)}, (31) 

where all the constants appearing in this bound depend on /, d and a, but not on A. Since 
a < 2, (pOj) remains bounded as n — )■ oo and ([28]) converges to 



Ml 
~{ JW^ Jo r ^ ^ JB{y,r) ^r 

- {z- Xi, Vi/(^))l{|^_^^|<i}] , 

which is precisely the second term of Q"f{A) (and is finite according to the analysis above). 
Tracing back our calculations, we see that for n large enough (such that n~^'" < 6{A)) the 
difference between Q'^f(A) and Q°'f{A) is equal to the difference between the quantity in (j30p 
and its counterpart in Q"'f{A) (that is, the second term of Q"f{A) in which y is only integrated 
over B{xi, 1)). Hence, according to ([3T]) . for every n > 5{A)~'^ 

\g-f{A)-g"fiA)\<Cf\A\n-'-^, 

where the constant Cf is again independent of A. Consequently, for every / which is compactly 

supported and of class C^ with respect to the marks, the function g°'f is bounded and the 

convergence 

lim sup |g"/(A)-g"/(A)| =0 (32) 

"■^°° 5(A)>e,\A\<k 

holds for any choice of e > and A; G N. 

To conclude the proof of Proposition 15.11 let us use the following result, whose proof we 
postpone for the sake of clarity. For every e > 0, let t^ be the first time at which at least two 
lineages lie at distance less than e > without having coalesced. 

Lemma 5.2. For every initial value Aq such that 6{Aq) > 0, we have 

limPAnfte <ool = 0. (33) 

£-S>0 

As a consequence, the martingale problem associated to {g°',Ao) has a unique solution (with 
cddldg paths) for any initial value Aq satisfying 6{Aq) > 0. Let us denote this solution by A°° . 
Then A°° is a consistent system of coalescing symmetric a-stable processes. 
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Let us suppose that Lemma 15.21 has been estabhshed, and verify that the conditions of 
Theorem 4.8.2(6) of |EK86) are then fulfilled. First, one can check that the set of functions / 
considered above is dense in the set of all bounded continuous functions on marked partitions. 
We can thus restrict our attention to these particular functions. Second, (p3|) enables us to use 
(j32p and dominated convergence to obtain that Condition (8.7) of Theorem 4.8.2(6) of [EK86J 
is satisfied, and consequently that the finite-dimensional distributions of A"^ converge weakly to 
those of A°° as n tends to infinity. The arguments for the convergence of the one-dimensional 
distributions of ^" are the same as in the case with fixed radii, and so the proof of Proposition l5.ll 
is now complete. D 

Before proving Lemma [5^ let us study some of properties of the 'genealogical' process A°°. 
Indeed, in order to use Lemma l3.2r i). we need to show that (jlOp holds. In fact we can be more 
precise about the way coalescence occurs. 

Lemma 5.3. Sample two individuals at separation x, and consider their ancestral lineages 
(Xt,t>0), {Yt,t>Q). Let 

T = inf{t >0:Xs = Ys for all s > t} 

be their coalescence time. Then r < oo almost surely, and moreover, there exists a random 
variable Z, a.s. finite and independent of x, such that 

T ^ x'^Z, (34) 

where -< stands for stochastic domination. 

Proof of Lemma 15.31 In essence, the strategy of the proof consists of showing that if the two 
lineages start at distance a > 0, they have some positive chance (independent of a) of coalescing 
before they either separate to a distance greater than 2a or come within distance less than a/2 
of each other. The dependence on x" in the lemma then comes from the fact that the time 
needed to coalesce, or separate, or get closer by a factor of 2, is of the order of x°' when the 
initial separation is x. 

By translation invariance, we may assume without loss of generality that the origin of M sits 
at the midpoint between Xq and Yq. Let T{x) be the first time that any point in B := B[0,x) 
is touched by an event whose radius r is greater than x/4. Then we claim that T{x) is an 
exponential random variable whose rate A(x) is given for every a; > by 



\{x) = J^^-^^^^^Yol{B{0,x + i)). (35) 

Indeed, recall the intensity measure ([3|) we introduced before rescaling the process. In the 
original units of time and space, the rate at which any point of the closed ball B{0,x) (x > 4) 
is hit by an event of radius greater than x/4 is given by 

/■ f°° di f°° di 

L ^' U ^^^^- li^(MnB(.,.)^0} = j^^^ j,^^ Vol(i?(0, X + £)). 

Multiplying this rate by n and looking at distances of the form xn^'", a simple change of 
variables gives us that for every x > 4n~^'°, the rescaled rate of interest is also equal to the 
expression above, independently of n. Passing to the limit n — >■ oo yields ([35]) . 
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Now, setting £ = rx we can write 

/■°° dr 
= ^""/ ;:dTTT^Vol(i?(0,l + r)) = C7x-", (36) 



'1/4 

where the constant C is independent of x. 

On the other hand, similar calculations enable us to see that the rate at which B is entirely 
contained within the area B{z,r) of an event is given by 

p d£ f°° d£ f 



fJ^''-mO,r-l)) = C'x- 



where we used the same change of variable as before and C" > is again independent of x. As 
a consequence, with probability pQ := C /C independent of x, the first event of radius greater 
than x/4 that hits at least one point of B actually covers the whole ball. Moreover, (j36l) also 
implies that for arbitrary q > 1/4, the radius R{x) of the event occurring at time T{x) satisfies 

F{R{x) > qx) < cq-"", (37) 

for some constant c which does not depend on x or q. 

Let X,Y he the motion of the lineages as governed by all the events except those that affect 
some point in B and whose radius is greater than x/i. Then by the Poisson point process 
formulation of the reproduction events, T{x) is independent of X,Y and {Xt,Yt,t < T{x)) 
coincides with {Xt,Yt,t < T{x)). Let S{x) := mi{t > : Dt < x/2 or Xt (^ B or Yt ^ B}, 
where Dt = \Xt — Yt\. Fix 6 > 0, and define the following events: 

E := {T{x) < fe"}, F := {S{x) > fe"}. 

Then E and F are independent, and by ([36]) there exists p{6) > such that P(£') = p{6) for 
all X > 0. A similar property holds for F. Indeed, note first that up until the time S{x), the 
trajectories X and Y are independent, since the trajectories can only move as a result of events 
occurring in necessarily disjoint regions of space. Moreover, it is easy to check that 

-^tx°'AS(x), -Ytx°'/\,s(x)) ^^ (38) 

X ^ ' X ^ '/t>0 

has the same distribution as the pair (^tA5(i))^M5(i))t>o obtained by taking x = 1: both 
coordinates of this process perform independent stable Levy processes where each jump greater 
than 1/4 occurring in -6(0, 1) is removed, and the process is stopped when either coordinate 
leaves -6(0, 1) or they come within distance 1/2 of one another. (A formal proof is given by 
comparing the generators; the generator of the pair (X, Y) is the same as Q'^f{A) when A has 
two blocks, as defined in (j26p . but with the first term equal to 0). 

Hence for ah x > 0, F{S{x) > fe") = P(5'(l) > 6) =: q{5), and q{6) > whenever 6 is 
chosen small enough. 

Let us denote the centre, radius and impact parameter of the event taking place at time 
T{x) by {Z (x) , R{x) , u) . We shall say that a success occurs if both E and F occur, and if 
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(a) B{0,x) C B{Z{x),R{x)), 

(b) both Xq-M , Yj-M are affected by the event occurring at time T(x) (this is possible since 
under these assumptions, Xj-m and Yt(x) ^^^ still both in -6(0, x) which is entirely covered 
by the event.) 

Note that by the above discussion, 

p := P( success ) = p{6)q{5)poU^ , (39) 

independently of x > 0. 

If a success did not occur, we say that a failure has occurred. Since the success probability is 
independent of x and the waiting time between two attempts is always stochastically bounded 
by an exponential random variable of the form T{y) (which is a.s. finite), we deduce that 
after a Geometric(p) number N of attempts, success is guaranteed, hence r < oo almost surely. 
Moreover, in the case of failure, consider the mutual distance DrpM^sf^\ between the two lineages 
at time T{x) A S{x). Then -Dt(x)aS(x) < 2x + R{x). From ([37|) we can deduce that there exists 
a random variable R, independent of x and a.s. finite, such that 2 + R{x)/x :< R'va. the sense of 
stochastic domination. Let Ri,R2, ■ ■ ■ be a sequence of i.i.d. random variables with distribution 
R. The strong Markov property and (j36p then show that 

r ^x' 



'{S[C] + S [CR^"] +...+£ [CiRi ■ ■ ■ Rn^] } , 



where £[y] stands for an exponential random variable with parameter y and all the above expo- 
nential random variables are conditionally independent given their arguments. Define Z as the 
random variable within the curly brackets to conclude. D 

Remark 5.4. The system A°° inherits the consistency property from its construction as the 
limit of A^ (this property can also be shown directly from the generator of A°°). Hence, a 
notable consequence of Lemma 15.31 is that any finite sample of lineages finds its most recent 
common ancestor in finite time with probability one. The same kind of behaviour, as well as the 
convergence of the forwards-in-time process to a field of correlated Bernoulli random variables, 
was already observed by Evans in the case where the genealogical process of his continuous sites 
stepping-stone model is a system of one-dimensional independent a-stable motions coalescing 
instantly upon meeting. See §5 in [Eva97j for a full description of his results. However, the 
underlying mechanisms are quite different here. Not only does Lemma [5 . 3 1 hold for any a £ (1, 2) 
and any dimension, which cannot be the case in Evans' framework since two independent stable 
processes may not meet, but even in dimension 1 the way lineages coalesce is different: the limit 
in ()33p shows that two lineages of A°° have no chance to meet, but their coalescence is due to 
the fact that large events of the appropriate size are just frequent enough to catch them even 
when they are very far from each other. As a last consequence, it is then possible to see multiple 
mergers during the evolution of A°^ , which is not the case when the a-stable processes move 
independently of each other and coalesce only when they meet. 

Let us now finish with the proof of Lemma [5. 2 1 and of Theorem 1 1.51 Recall that for any marked 
partition A, 5{A) stands for half the minimum distance between two marks in A {S{A) = -|-oo 
if A has only one block) . 

Proof of Lemma 15.21 Because most of the ideas and computations we shall use to establish 
(|33p are developed in detail in the proof of Lemma 15.31 we only present an outline here and 
refer to that proof for more precise arguments. Since we always deal with partitions of some 
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finite set, it is sufficient to show the result when Aq consists of just two blocks starting at some 
positive separation. 

If X > denotes the initial distance between our two lineages, let us call T{x) the first 
time at which any of the lineages is in the geographical area of an event of radius greater 
than x/4, and let us call S{x) the first time at which the distance between the two lineages 
is greater than 2x, or less than x/2. Notice that the lineages evolve independently until the 
random time T{x) A S{x), since they are hit by events that are necessarily disjoint until that 
time. Moreover, they both move according to the law of a symmetric a-stable process whose 
large jumps have been truncated (see reasoning below (f38l) ). Hence it is not difficult to show 
that S{x) is of the order of x", and so is T{x), while the coalescence rate of two lineages at 
distance x is commensurate with x~'^. Using the more careful analysis performed in the proof 
of Lemma 15.31 we can in fact conclude that the probability pQ that the two lineages coalesce 
before their distance doubles or is divided by two is not only positive, but also independent of 
X. Together with the fact that T{y) A S{y) is a.s. finite for every y > (for reasons expounded 
in Lemma l5.3p . the number of attempts before succeeding to coalesce is a geometric random 
variable with parameter po, which we shall denote by N. 

As a second step, suppose that the lineages fail to coalesce at time T{x) A 5(x). The new 
location of the lineage which jumps at that time (at most one of them jumps, otherwise they 
would coalesce) is uniformly distributed over the area of the event, and since the lineages are 
at distance at least x/2 from each other just before T(x) A S{x) a small calculation using the 
scaling properties of the evolution mechanism shows that the probability 7r(r/) that their new 
distance at that time is less than r]x satisfies 

(a) TT^T]) is independent of x, 

(b) limrf^o IT {r]) = 0. 

As a consequence, if r/ G (0, 1/10) and /c G N, we can write 

P^Q [coal, before distance decreases by 7/ ] > E[(1 — 7r(?7)) ^ 1|jv<A:}]- (40) 

Note in passing that, by monotonicity, the same inequality holds if we replace rj by any e < t] . 
Let us now draw some conclusions from these observations. We fix c > 0, and choose k{c) 
and 77(c) such that for every k > k{c) and r] < r]{c), 

F[N>k]<- and E[{1 - tt{t]))^-^] > I - -. 

Then, using the fact that the event described in the left-hand side of ()40p implies t^ = +00 for 
every e < rj x, we have that, for every such e, 

FAo[te = Oo]>l-^-^ = l-C. 

Since c was arbitrary, (ISSp follows. 

As regards the second part of Lemma 15. 2| recall from (j26p that the operator Q" is defined, 
for every function / of class C^ with compact support and every marked partition A satisfying 
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6{A) > 0, by 

jRd J 5(A) r°+''+^ JB{v,r) Vt ,^ r, ...r.-^. 

\A\ 



fyU 






~{ JK'* JO 

f fiz 

X / V [/(^W(A ^)) - f{A) -{z- x„ V,/(^))l||,_,^|<i}] 

JB(y,r) Vr 
j=l -^IK -^0 r ^ ^ JB{y,r) ^r 



In particular, if ^ = {(61, xi)} contains only one block and if / is a function of its mark only, 
then Q"f{A) is equal to 



00 



'^ {xi€B{y,r)}dr r dz 

r^+'^+l JBi^y^r) Vr 



^ dy 'ZZZi / TT[m-fi^i)-{^-^i,^fi^i)n{\z~.,\<i}] 



f f°° 'i-{xieBiy,r)}dr f dz , ^.r w-, 

JRd JO r ^ ^ JB{y,r) ^r 

^ L.^^l /q ;:i^TdTT ^^^ ) [/(^)-/(^i)-(^-^i'W(^i))l{|.-xi|<i}J 

(ir Vol(S(z,r)nS(xi,r)) 
,-^Td+T Vr 



+ "" L.dz\ I -^^^^ ) (z-xi,V/(xi))l||,_,^|<i|. 



The second term above is zero (by symmetry) and hence the generator can be written as 

g^f{A) = f dz i{z - xi) {f{z) - /(xi)) , (41) 

where the intensity i{w) is given by 

dr Vol(-B(tf,r)nB(0,r)) 



Jo r"+<^+i Vr 

Now, one can check that for any k > 

k i{zk-^''^) d{zk-^/'^) = l{z) dz 

and so the motion of a single lineage is a symmetric a-stable Levy process. 

When there are at least two blocks, as long as 5{A^) > the first term of G^fiA'^) is finite 
and clearly represents the merger and jump at finite rate of several blocks of A°°. However, the 
coalescence rate of two lineages at distance e is equal to 

f f°° dr f°° dr 

""idy -^Td+T ^{-1.-2 6^(2^.-)} = ^/ , ;^TdTT^°K^(^i'0nS(x2,r)) oce-" 

as e — ;■ 0, and so one can prove the existence of the process A^ only up to t^, for any e > 0. 
Yet (j33p is actually more than what is required to invoke Theorem 4.6.3 in |EK86| and complete 
the proof of existence of A^ . D 
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Proof of Theorem II. 5L There is nothing else to do. Duahty and the convergence of A"" give 
us the convergence of p"" exactly as in the proof of Theorem ll.il Lemma [5.31 is sufficient to show 
that (llOp holds and so the limiting densities w^°'' {t, x) are Bernoulli random variables as stated. 

D 
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